Random and Synthetic Over-Sampling Approach to Resolve Data Imbalance in Classification
نویسندگان
چکیده
منابع مشابه
Geometric mean based boosting algorithm with over-sampling to resolve data imbalance problem for bankruptcy prediction
In classification or prediction tasks, data imbalance problem is frequently observed when most of instances belong to one majority class. Data imbalance problem has received considerable attention in machine learning community because it is one of the main causes that degrade the performance of classifiers or predictors. In this paper, we propose geometric mean based boosting algorithm (GMBoost...
متن کاملClassification of Imbalanced Data Using Synthetic Over-Sampling Techniques
of the Thesis Classification of Imbalanced Data Using Synthetic Over-Sampling Techniques
متن کاملthe clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
data mining rules and classification methods in insurance: the case of collision insurance
assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...
15 صفحه اولovarian cancer classification using hybrid synthetic minority over-sampling technique and neural network
every woman is at risk of ovarian cancer; about 90 percent of women who develop ovarian cancer are above 40 years of age, with the high number of ovarian cancers occurring at the age of 60 years and above. early and correct diagnosis of ovarian cancer can allow proper treatment and as a result reduce the mortality rate. in this paper, we proposed a hybrid of synthetic minority over-sampling tec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Artificial Intelligence Research
سال: 2021
ISSN: 2579-7298
DOI: 10.29099/ijair.v4i2.152